Search CORE

49 research outputs found

Online Deception Detection Refueled by Real World Data Collection

Author: Caverlee James
Dai Zeyu
Huang Ruihong
Yao Wenlin
Publication venue
Publication date: 28/07/2017
Field of study

The lack of large realistic datasets presents a bottleneck in online deception detection studies. In this paper, we apply a data collection method based on social network analysis to quickly identify high-quality deceptive and truthful online reviews from Amazon. The dataset contains more than 10,000 deceptive reviews and is diverse in product domains and reviewers. Using this dataset, we explore effective general features for online deception detection that perform well across domains. We demonstrate that with generalized features - advertising speak and writing complexity scores - deception detection performance can be further improved by adding additional deceptive reviews from assorted domains in training. Finally, reviewer level evaluation gives an interesting insight into different deceptive reviewers' writing styles.Comment: 10 pages, Accepted to Recent Advances in Natural Language Processing (RANLP) 201

arXiv.org e-Print Archive

Crossref

Weakly-supervised Learning Approaches for Event Knowledge Acquisition and Event Detection

Author: Yao Wenlin
Publication venue
Publication date: 27/04/2021
Field of study

Capabilities of detecting events and recognizing temporal, subevent, or causality relations among events can facilitate many applications in natural language understanding. However, supervised learning approaches that previous research mainly uses have two problems. First, due to the limited size of annotated data, supervised systems cannot sufficiently capture diverse contexts to distill universal event knowledge. Second, under certain application circumstances such as event recognition during emergent natural disasters, it is infeasible to spend days or weeks to annotate enough data to train a system. My research aims to use weakly-supervised learning to address these problems and to achieve automatic event knowledge acquisition and event recognition. In this dissertation, I first introduce three weakly-supervised learning approaches that have been shown effective in acquiring event relational knowledge. Firstly, I explore the observation that regular event pairs show a consistent temporal relation despite of their various contexts, and these rich contexts can be used to train a contextual temporal relation classifier to further recognize new temporal relation knowledge. Secondly, inspired by the double temporality characteristic of narrative texts, I propose a weakly supervised approach that identifies 287k narrative paragraphs using narratology principles and then extract rich temporal event knowledge from identified narratives. Lastly, I develop a subevent knowledge acquisition approach by exploiting two observations that 1) subevents are temporally contained by the parent event and 2) the definitions of the parent event can be used to guide the identification of subevents. I collect rich weak supervision to train a contextual BERT classifier and apply the classifier to identify new subevent knowledge. Recognizing texts that describe specific categories of events is also challenging due to language ambiguity and diverse descriptions of events. So I also propose a novel method to rapidly build a fine-grained event recognition system on social media texts for disaster management. My method creates high-quality weak supervision based on clustering-assisted word sense disambiguation and enriches tweet message representations using preceding context tweets and reply tweets in building event recognition classifiers

Texas A&M Repository

A Graph-based Approach for Detecting Critical Infrastructure Disruptions on Social Media in Disasters

Author: Fan Chao
Huang Ruihong
Mostafavi Ali
Yao Wenlin
Publication venue: AIS Electronic Library (AISeL)
Publication date: 01/01/2019
Field of study

The objective of this paper is to propose and test a graph-based approach for detection of critical infrastructure disruptions in social media data in disasters. Understanding the situation and disruptive events of critical infrastructure is essential to effective disaster response and recovery of communities. The potential of social media data for situation awareness during disasters has been highlighted in recent studies. However, the application of social sensing in detecting disruptions of critical infrastructure is limited because existing approaches cannot provide complete and non-ambiguous situational information about critical infrastructure. Therefore, to address this methodological gap, we developed a graph-based approach including data filtering, burst time-frame detection, content similarity and graph analysis. A case study of Hurricane Harvey in 2017 in Houston was conducted to illustrate the application of the proposed approach. The findings highlighted the temporal patterns of critical infrastructure events that occurred in disasters including disruptive events and their adverse impacts on communities. The findings also provided insights for better understanding critical infrastructure interdependencies in disasters. From the practical perspective, the proposed methodology study can improve the ability of community members, first responders and decision makers to detect and respond to infrastructure disruptions in disasters

Crossref

ScholarSpace at University of Hawai'i at Manoa

AIS Electronic Library (AISeL)

A Stitch in Time Saves Nine: Detecting and Mitigating Hallucinations of LLMs by Validating Low-Confidence Generation

Author: Chen Jianshu
Varshney Neeraj
Yao Wenlin
Yu Dong
Zhang Hongming
Publication venue
Publication date: 08/07/2023
Field of study

Recently developed large language models have achieved remarkable success in generating fluent and coherent text. However, these models often tend to 'hallucinate' which critically hampers their reliability. In this work, we address this crucial problem and propose an approach that actively detects and mitigates hallucinations during the generation process. Specifically, we first identify the candidates of potential hallucination leveraging the model's logit output values, check their correctness through a validation procedure, mitigate the detected hallucinations, and then continue with the generation process. Through extensive experiments with the 'article generation task', we first demonstrate the individual efficacy of our detection and mitigation techniques. Specifically, the detection technique achieves a recall of 88% and the mitigation technique successfully mitigates 57.6% of the correctly detected hallucinations. Importantly, our mitigation technique does not introduce new hallucinations even in the case of incorrectly detected hallucinations, i.e., false positives. Then, we show that the proposed active detection and mitigation approach successfully reduces the hallucinations of the GPT-3 model from 47.5% to 14.5% on average. In summary, our work contributes to improving the reliability and trustworthiness of large language models, a crucial step en route to enabling their widespread adoption in real-world applications

arXiv.org e-Print Archive

Weakly-supervised Fine-grained Event Recognition on Social Media Texts for Disaster Management

Author: Huang Ruihong
Mostafavi Ali
Saravanan Shiva
Yao Wenlin
Zhang Cheng
Publication venue: 'Association for the Advancement of Artificial Intelligence (AAAI)'
Publication date: 03/04/2020
Field of study

People increasingly use social media to report emergencies, seek help or share information during disasters, which makes social networks an important tool for disaster management. To meet these time-critical needs, we present a weakly supervised approach for rapidly building high-quality classifiers that label each individual Twitter message with fine-grained event categories. Most importantly, we propose a novel method to create high-quality labeled data in a timely manner that automatically clusters tweets containing an event keyword and asks a domain expert to disambiguate event word senses and label clusters quickly. In addition, to process extremely noisy and often rather short user-generated messages, we enrich tweet representations using preceding context tweets and reply tweets in building event recognition classifiers. The evaluation on two hurricanes, Harvey and Florence, shows that using only 1-2 person-hours of human supervision, the rapidly trained weakly supervised classifiers outperform supervised classifiers trained using more than ten thousand annotated tweets created in over 50 person-hours.Comment: In Proceedings of the AAAI 2020 (AI for Social Impact Track). Link: https://aaai.org/ojs/index.php/AAAI/article/view/539

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

NarraSum: A Large-Scale Dataset for Abstractive Narrative Summarization

Author: Brahman Faeze
Chaturvedi Snigdha
Song Kaiqiang
Yao Wenlin
Yu Dian
Zhao Chao
Publication venue
Publication date: 28/06/2023
Field of study

Narrative summarization aims to produce a distilled version of a narrative to describe its most salient events and characters. Summarizing a narrative is challenging as it requires an understanding of event causality and character behaviors. To encourage research in this direction, we propose NarraSum, a large-scale narrative summarization dataset. It contains 122K narrative documents, which are collected from plot descriptions of movies and TV episodes with diverse genres, and their corresponding abstractive summaries. Experiments show that there is a large performance gap between humans and the state-of-the-art summarization models on NarraSum. We hope that this dataset will promote future research in summarization, as well as broader studies of natural language understanding and generation. The dataset is available at https://github.com/zhaochaocs/narrasum.Comment: EMNLP Findings 202

arXiv.org e-Print Archive